Overview
Brought to you by YData
Dataset statistics
| Training Data | Original Data | |
|---|---|---|
| Number of variables | 12 | 12 |
| Number of observations | 593994 | 20000 |
| Missing cells | 0 | 0 |
| Missing cells (%) | 0.0% | 0.0% |
| Duplicate rows | 0 | 0 |
| Duplicate rows (%) | 0.0% | 0.0% |
| Total size in memory | 54.4 MiB | 1.8 MiB |
| Average record size in memory | 96.0 B | 96.0 B |
Variable types
| Training Data | Original Data | |
|---|---|---|
| Numeric | 5 | 5 |
| Categorical | 7 | 7 |
| Training Data | Original Data | |
|---|---|---|
credit_score is highly overall correlated with grade_subgrade and 1 other fields | credit_score is highly overall correlated with grade_subgrade and 1 other fields | High correlation |
employment_status is highly overall correlated with loan_paid_back | employment_status is highly overall correlated with loan_paid_back | High correlation |
grade_subgrade is highly overall correlated with credit_score | grade_subgrade is highly overall correlated with credit_score | High correlation |
interest_rate is highly overall correlated with credit_score | interest_rate is highly overall correlated with credit_score | High correlation |
loan_paid_back is highly overall correlated with employment_status | loan_paid_back is highly overall correlated with employment_status | High correlation |
Reproduction
| Training Data | Original Data | |
|---|---|---|
| Analysis started | 2025-11-14 17:00:20.786996 | 2025-11-14 17:00:51.510318 |
| Analysis finished | 2025-11-14 17:00:38.380925 | 2025-11-14 17:00:56.083976 |
| Duration | 17.59 seconds | 4.57 seconds |
| Software version | ydata-profiling vv4.17.0 | ydata-profiling vv4.17.0 |
| Download configuration | config.json | config.json |
Variables
annual_income
Real number (ℝ)
| Training Data | Original Data | |
|---|---|---|
| Distinct | 119728 | 19947 |
| Distinct (%) | 20.2% | 99.7% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 48212.203 | 43549.638 |
| Training Data | Original Data | |
|---|---|---|
| Minimum | 6002.43 | 6000 |
| Maximum | 393381.74 | 400000 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 156.4 KiB |
Quantile statistics
| Training Data | Original Data | |
|---|---|---|
| Minimum | 6002.43 | 6000 |
| 5-th percentile | 15450.11 | 13377.715 |
| Q1 | 27934.4 | 24260.753 |
| median | 46557.68 | 36585.26 |
| Q3 | 60981.32 | 54677.917 |
| 95-th percentile | 93534.68 | 97663.691 |
| Maximum | 393381.74 | 400000 |
| Range | 387379.31 | 394000 |
| Interquartile range (IQR) | 33046.92 | 30417.165 |
Descriptive statistics
| Training Data | Original Data | |
|---|---|---|
| Standard deviation | 26711.942 | 28668.58 |
| Coefficient of variation (CV) | 0.5540494 | 0.65829663 |
| Kurtosis | 7.0914126 | 10.952957 |
| Mean | 48212.203 | 43549.638 |
| Median Absolute Deviation (MAD) | 17068.9 | 14200.04 |
| Skewness | 1.7195087 | 2.3307532 |
| Sum | 2.8637759 × 1010 | 8.7099276 × 108 |
| Variance | 7.1352785 × 108 | 8.2188746 × 108 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 51351.71 | 238 | < 0.1% |
| 25499.88 | 227 | < 0.1% |
| 24113.12 | 219 | < 0.1% |
| 56547.75 | 209 | < 0.1% |
| 26386.33 | 187 | < 0.1% |
| 28991.07 | 185 | < 0.1% |
| 16077.08 | 170 | < 0.1% |
| 46949.29 | 160 | < 0.1% |
| 53981.9 | 152 | < 0.1% |
| 52628.69 | 146 | < 0.1% |
| Other values (119718) | 592101 |
| Value | Count | Frequency (%) |
| 6000 | 26 | 0.1% |
| 28316.41 | 2 | < 0.1% |
| 26386.33 | 2 | < 0.1% |
| 25860.67 | 2 | < 0.1% |
| 18205.78 | 2 | < 0.1% |
| 40010.06 | 2 | < 0.1% |
| 16664.34 | 2 | < 0.1% |
| 17306.58 | 2 | < 0.1% |
| 56547.75 | 2 | < 0.1% |
| 36822.03 | 2 | < 0.1% |
| Other values (19937) | 19956 |
| Value | Count | Frequency (%) |
| 6002.43 | 1 | < 0.1% |
| 6008.56 | 1 | < 0.1% |
| 6026.31 | 3 | |
| 6026.47 | 1 | < 0.1% |
| 6026.71 | 1 | < 0.1% |
| 6064.78 | 1 | < 0.1% |
| 6071.69 | 1 | < 0.1% |
| 6073.15 | 1 | < 0.1% |
| 6074.92 | 1 | < 0.1% |
| 6093.55 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6000 | 26 | |
| 6002.43 | 1 | < 0.1% |
| 6008.56 | 1 | < 0.1% |
| 6018.9 | 1 | < 0.1% |
| 6026.31 | 1 | < 0.1% |
| 6100.32 | 1 | < 0.1% |
| 6105.99 | 1 | < 0.1% |
| 6109.87 | 1 | < 0.1% |
| 6151.16 | 1 | < 0.1% |
| 6166.42 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6000 | 26 | |
| 6002.43 | 1 | < 0.1% |
| 6008.56 | 1 | < 0.1% |
| 6018.9 | 1 | < 0.1% |
| 6026.31 | 1 | < 0.1% |
| 6100.32 | 1 | < 0.1% |
| 6105.99 | 1 | < 0.1% |
| 6109.87 | 1 | < 0.1% |
| 6151.16 | 1 | < 0.1% |
| 6166.42 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6002.43 | 1 | < 0.1% |
| 6008.56 | 1 | < 0.1% |
| 6026.31 | 3 | |
| 6026.47 | 1 | < 0.1% |
| 6026.71 | 1 | < 0.1% |
| 6064.78 | 1 | < 0.1% |
| 6071.69 | 1 | < 0.1% |
| 6073.15 | 1 | < 0.1% |
| 6074.92 | 1 | < 0.1% |
| 6093.55 | 1 | < 0.1% |
debt_to_income_ratio
Real number (ℝ)
| Training Data | Original Data | |
|---|---|---|
| Distinct | 526 | 555 |
| Distinct (%) | 0.1% | 2.8% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.12069589 | 0.1770193 |
| Training Data | Original Data | |
|---|---|---|
| Minimum | 0.011 | 0.01 |
| Maximum | 0.627 | 0.667 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 156.4 KiB |
Quantile statistics
| Training Data | Original Data | |
|---|---|---|
| Minimum | 0.011 | 0.01 |
| 5-th percentile | 0.046 | 0.037 |
| Q1 | 0.072 | 0.096 |
| median | 0.096 | 0.16 |
| Q3 | 0.156 | 0.241 |
| 95-th percentile | 0.259 | 0.376 |
| Maximum | 0.627 | 0.667 |
| Range | 0.616 | 0.657 |
| Interquartile range (IQR) | 0.084 | 0.145 |
Descriptive statistics
| Training Data | Original Data | |
|---|---|---|
| Standard deviation | 0.068573259 | 0.10505934 |
| Coefficient of variation (CV) | 0.56814907 | 0.59349087 |
| Kurtosis | 2.33523 | 0.36506797 |
| Mean | 0.12069589 | 0.1770193 |
| Median Absolute Deviation (MAD) | 0.032 | 0.07 |
| Skewness | 1.4066799 | 0.78773924 |
| Sum | 71692.635 | 3540.386 |
| Variance | 0.0047022918 | 0.011037464 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.09 | 11440 | 1.9% |
| 0.093 | 11160 | 1.9% |
| 0.097 | 9508 | 1.6% |
| 0.079 | 9099 | 1.5% |
| 0.094 | 8976 | 1.5% |
| 0.098 | 8647 | 1.5% |
| 0.071 | 8192 | 1.4% |
| 0.096 | 7715 | 1.3% |
| 0.063 | 7579 | 1.3% |
| 0.067 | 7373 | 1.2% |
| Other values (516) | 504305 |
| Value | Count | Frequency (%) |
| 0.093 | 101 | 0.5% |
| 0.09 | 96 | 0.5% |
| 0.116 | 92 | 0.5% |
| 0.097 | 91 | 0.5% |
| 0.11 | 91 | 0.5% |
| 0.079 | 90 | 0.4% |
| 0.13 | 90 | 0.4% |
| 0.094 | 89 | 0.4% |
| 0.063 | 89 | 0.4% |
| 0.12 | 88 | 0.4% |
| Other values (545) | 19083 |
| Value | Count | Frequency (%) |
| 0.011 | 169 | |
| 0.012 | 55 | < 0.1% |
| 0.013 | 127 | |
| 0.014 | 243 | |
| 0.015 | 138 | |
| 0.016 | 80 | < 0.1% |
| 0.017 | 205 | |
| 0.018 | 186 | |
| 0.019 | 61 | < 0.1% |
| 0.02 | 152 |
| Value | Count | Frequency (%) |
| 0.01 | 84 | |
| 0.011 | 26 | 0.1% |
| 0.012 | 15 | 0.1% |
| 0.013 | 21 | 0.1% |
| 0.014 | 29 | 0.1% |
| 0.015 | 26 | 0.1% |
| 0.016 | 24 | 0.1% |
| 0.017 | 29 | 0.1% |
| 0.018 | 26 | 0.1% |
| 0.019 | 19 | 0.1% |
| Value | Count | Frequency (%) |
| 0.01 | 84 | |
| 0.011 | 26 | < 0.1% |
| 0.012 | 15 | < 0.1% |
| 0.013 | 21 | < 0.1% |
| 0.014 | 29 | < 0.1% |
| 0.015 | 26 | < 0.1% |
| 0.016 | 24 | < 0.1% |
| 0.017 | 29 | < 0.1% |
| 0.018 | 26 | < 0.1% |
| 0.019 | 19 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.011 | 169 | |
| 0.012 | 55 | 0.3% |
| 0.013 | 127 | |
| 0.014 | 243 | |
| 0.015 | 138 | |
| 0.016 | 80 | 0.4% |
| 0.017 | 205 | |
| 0.018 | 186 | |
| 0.019 | 61 | 0.3% |
| 0.02 | 152 |
credit_score
Real number (ℝ)
| Training Data | Original Data | |
|---|---|---|
| Distinct | 399 | 399 |
| Distinct (%) | 0.1% | 2.0% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 680.91601 | 679.25695 |
| Training Data | Original Data | |
|---|---|---|
| Minimum | 395 | 373 |
| Maximum | 849 | 850 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 156.4 KiB |
Quantile statistics
| Training Data | Original Data | |
|---|---|---|
| Minimum | 395 | 373 |
| 5-th percentile | 582 | 565 |
| Q1 | 646 | 632 |
| median | 682 | 680 |
| Q3 | 719 | 727 |
| 95-th percentile | 767 | 794 |
| Maximum | 849 | 850 |
| Range | 454 | 477 |
| Interquartile range (IQR) | 73 | 95 |
Descriptive statistics
| Training Data | Original Data | |
|---|---|---|
| Standard deviation | 55.424956 | 69.63858 |
| Coefficient of variation (CV) | 0.08139764 | 0.1025217 |
| Kurtosis | 0.09596164 | -0.13134364 |
| Mean | 680.91601 | 679.25695 |
| Median Absolute Deviation (MAD) | 36 | 47 |
| Skewness | -0.16699288 | -0.070714162 |
| Sum | 4.0446002 × 108 | 13585139 |
| Variance | 3071.9257 | 4849.5318 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 678 | 6526 | 1.1% |
| 661 | 5801 | 1.0% |
| 674 | 5793 | 1.0% |
| 708 | 5661 | 1.0% |
| 681 | 5635 | 0.9% |
| 672 | 5622 | 0.9% |
| 669 | 5618 | 0.9% |
| 685 | 5557 | 0.9% |
| 713 | 5544 | 0.9% |
| 676 | 5508 | 0.9% |
| Other values (389) | 536729 |
| Value | Count | Frequency (%) |
| 850 | 141 | 0.7% |
| 678 | 135 | 0.7% |
| 669 | 129 | 0.6% |
| 683 | 127 | 0.6% |
| 661 | 127 | 0.6% |
| 708 | 127 | 0.6% |
| 685 | 126 | 0.6% |
| 672 | 123 | 0.6% |
| 688 | 123 | 0.6% |
| 703 | 123 | 0.6% |
| Other values (389) | 18719 |
| Value | Count | Frequency (%) |
| 395 | 2 | |
| 431 | 1 | < 0.1% |
| 435 | 2 | |
| 437 | 3 | |
| 439 | 1 | < 0.1% |
| 440 | 1 | < 0.1% |
| 441 | 1 | < 0.1% |
| 445 | 4 | |
| 446 | 2 | |
| 447 | 2 |
| Value | Count | Frequency (%) |
| 373 | 1 | < 0.1% |
| 395 | 1 | < 0.1% |
| 435 | 1 | < 0.1% |
| 439 | 1 | < 0.1% |
| 440 | 1 | < 0.1% |
| 441 | 1 | < 0.1% |
| 443 | 1 | < 0.1% |
| 445 | 3 | |
| 447 | 1 | < 0.1% |
| 448 | 2 |
| Value | Count | Frequency (%) |
| 373 | 1 | < 0.1% |
| 395 | 1 | < 0.1% |
| 435 | 1 | < 0.1% |
| 439 | 1 | < 0.1% |
| 440 | 1 | < 0.1% |
| 441 | 1 | < 0.1% |
| 443 | 1 | < 0.1% |
| 445 | 3 | |
| 447 | 1 | < 0.1% |
| 448 | 2 |
| Value | Count | Frequency (%) |
| 395 | 2 | |
| 431 | 1 | < 0.1% |
| 435 | 2 | |
| 437 | 3 | |
| 439 | 1 | < 0.1% |
| 440 | 1 | < 0.1% |
| 441 | 1 | < 0.1% |
| 445 | 4 | |
| 446 | 2 | |
| 447 | 2 |
loan_amount
Real number (ℝ)
| Training Data | Original Data | |
|---|---|---|
| Distinct | 111570 | 18819 |
| Distinct (%) | 18.8% | 94.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 15020.298 | 15129.301 |
| Training Data | Original Data | |
|---|---|---|
| Minimum | 500.09 | 500 |
| Maximum | 48959.95 | 49039.69 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 156.4 KiB |
Quantile statistics
| Training Data | Original Data | |
|---|---|---|
| Minimum | 500.09 | 500 |
| 5-th percentile | 3139.37 | 500 |
| Q1 | 10279.62 | 8852.695 |
| median | 15000.22 | 14946.17 |
| Q3 | 18858.58 | 20998.868 |
| 95-th percentile | 27139.83 | 29724.311 |
| Maximum | 48959.95 | 49039.69 |
| Range | 48459.86 | 48539.69 |
| Interquartile range (IQR) | 8578.96 | 12146.173 |
Descriptive statistics
| Training Data | Original Data | |
|---|---|---|
| Standard deviation | 6926.5306 | 8605.4055 |
| Coefficient of variation (CV) | 0.46114469 | 0.56879069 |
| Kurtosis | -0.15014223 | -0.32712232 |
| Mean | 15020.298 | 15129.301 |
| Median Absolute Deviation (MAD) | 4386.47 | 6070.715 |
| Skewness | 0.20735982 | 0.25309257 |
| Sum | 8.9219667 × 109 | 3.0258602 × 108 |
| Variance | 47976826 | 74053004 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 12892.25 | 412 | 0.1% |
| 15212.88 | 338 | 0.1% |
| 16004.97 | 282 | < 0.1% |
| 1838.88 | 278 | < 0.1% |
| 17051.01 | 255 | < 0.1% |
| 15011.15 | 250 | < 0.1% |
| 18078.57 | 241 | < 0.1% |
| 12551.14 | 241 | < 0.1% |
| 18054.98 | 237 | < 0.1% |
| 8146.24 | 232 | < 0.1% |
| Other values (111560) | 591228 |
| Value | Count | Frequency (%) |
| 500 | 1120 | 5.6% |
| 11930.42 | 2 | < 0.1% |
| 15222.83 | 2 | < 0.1% |
| 8354.29 | 2 | < 0.1% |
| 10439.05 | 2 | < 0.1% |
| 14988.46 | 2 | < 0.1% |
| 15212.88 | 2 | < 0.1% |
| 5844.81 | 2 | < 0.1% |
| 16181.65 | 2 | < 0.1% |
| 19010.18 | 2 | < 0.1% |
| Other values (18809) | 18862 |
| Value | Count | Frequency (%) |
| 500.09 | 1 | < 0.1% |
| 500.37 | 1 | < 0.1% |
| 500.91 | 1 | < 0.1% |
| 502.91 | 1 | < 0.1% |
| 507.41 | 1 | < 0.1% |
| 507.42 | 1 | < 0.1% |
| 507.46 | 3 | |
| 507.86 | 1 | < 0.1% |
| 508.34 | 1 | < 0.1% |
| 508.35 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 500 | 1120 | |
| 502.91 | 1 | < 0.1% |
| 507.46 | 1 | < 0.1% |
| 512.53 | 1 | < 0.1% |
| 514.5 | 1 | < 0.1% |
| 515.52 | 1 | < 0.1% |
| 517.14 | 1 | < 0.1% |
| 518.18 | 1 | < 0.1% |
| 524.41 | 1 | < 0.1% |
| 525.03 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 500 | 1120 | |
| 502.91 | 1 | < 0.1% |
| 507.46 | 1 | < 0.1% |
| 512.53 | 1 | < 0.1% |
| 514.5 | 1 | < 0.1% |
| 515.52 | 1 | < 0.1% |
| 517.14 | 1 | < 0.1% |
| 518.18 | 1 | < 0.1% |
| 524.41 | 1 | < 0.1% |
| 525.03 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 500.09 | 1 | < 0.1% |
| 500.37 | 1 | < 0.1% |
| 500.91 | 1 | < 0.1% |
| 502.91 | 1 | < 0.1% |
| 507.41 | 1 | < 0.1% |
| 507.42 | 1 | < 0.1% |
| 507.46 | 3 | |
| 507.86 | 1 | < 0.1% |
| 508.34 | 1 | < 0.1% |
| 508.35 | 1 | < 0.1% |
interest_rate
Real number (ℝ)
| Training Data | Original Data | |
|---|---|---|
| Distinct | 1454 | 1365 |
| Distinct (%) | 0.2% | 6.8% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 12.356345 | 12.400627 |
| Training Data | Original Data | |
|---|---|---|
| Minimum | 3.2 | 3.14 |
| Maximum | 20.99 | 22.51 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 156.4 KiB |
Quantile statistics
| Training Data | Original Data | |
|---|---|---|
| Minimum | 3.2 | 3.14 |
| 5-th percentile | 9.1 | 8.43 |
| Q1 | 10.99 | 10.74 |
| median | 12.37 | 12.4 |
| Q3 | 13.68 | 14.0025 |
| 95-th percentile | 15.72 | 16.48 |
| Maximum | 20.99 | 22.51 |
| Range | 17.79 | 19.37 |
| Interquartile range (IQR) | 2.69 | 3.2625 |
Descriptive statistics
| Training Data | Original Data | |
|---|---|---|
| Standard deviation | 2.0089589 | 2.4427288 |
| Coefficient of variation (CV) | 0.1625852 | 0.19698431 |
| Kurtosis | 0.059797501 | -0.01614091 |
| Mean | 12.356345 | 12.400627 |
| Median Absolute Deviation (MAD) | 1.34 | 1.63 |
| Skewness | 0.049945315 | 0.027425966 |
| Sum | 7339594.9 | 248012.53 |
| Variance | 4.0359159 | 5.9669241 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 12.31 | 2638 | 0.4% |
| 12.52 | 2436 | 0.4% |
| 13.35 | 2415 | 0.4% |
| 12.82 | 2406 | 0.4% |
| 12.23 | 2362 | 0.4% |
| 11.26 | 2318 | 0.4% |
| 11.6 | 2236 | 0.4% |
| 13.78 | 2222 | 0.4% |
| 12.09 | 2215 | 0.4% |
| 12.81 | 2209 | 0.4% |
| Other values (1444) | 570537 |
| Value | Count | Frequency (%) |
| 12.31 | 46 | 0.2% |
| 13.78 | 45 | 0.2% |
| 11.26 | 45 | 0.2% |
| 12.82 | 45 | 0.2% |
| 12.52 | 44 | 0.2% |
| 12.23 | 44 | 0.2% |
| 12.09 | 44 | 0.2% |
| 13.35 | 43 | 0.2% |
| 11.6 | 43 | 0.2% |
| 12.5 | 42 | 0.2% |
| Other values (1355) | 19559 |
| Value | Count | Frequency (%) |
| 3.2 | 1 | < 0.1% |
| 3.32 | 1 | < 0.1% |
| 3.66 | 1 | < 0.1% |
| 3.79 | 1 | < 0.1% |
| 3.81 | 3 | |
| 3.83 | 1 | < 0.1% |
| 3.89 | 2 | |
| 3.92 | 1 | < 0.1% |
| 3.98 | 2 | |
| 4.01 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3.14 | 1 | |
| 3.2 | 1 | |
| 3.63 | 1 | |
| 3.79 | 1 | |
| 3.81 | 1 | |
| 3.92 | 1 | |
| 3.98 | 1 | |
| 4.11 | 1 | |
| 4.18 | 1 | |
| 4.29 | 1 |
| Value | Count | Frequency (%) |
| 3.14 | 1 | |
| 3.2 | 1 | |
| 3.63 | 1 | |
| 3.79 | 1 | |
| 3.81 | 1 | |
| 3.92 | 1 | |
| 3.98 | 1 | |
| 4.11 | 1 | |
| 4.18 | 1 | |
| 4.29 | 1 |
| Value | Count | Frequency (%) |
| 3.2 | 1 | < 0.1% |
| 3.32 | 1 | < 0.1% |
| 3.66 | 1 | < 0.1% |
| 3.79 | 1 | < 0.1% |
| 3.81 | 3 | |
| 3.83 | 1 | < 0.1% |
| 3.89 | 2 | |
| 3.92 | 1 | < 0.1% |
| 3.98 | 2 | |
| 4.01 | 1 | < 0.1% |
gender
Categorical
| Training Data | Original Data | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 156.4 KiB |
| Female | |
|---|---|
| Male | |
| Other | 3728 |
| Female | |
|---|---|
| Male | |
| Other | 430 |
Length
| Training Data | Original Data | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 6 | 6 |
| Mean length | 5.0371788 | 5.0249 |
| Min length | 4 | 4 |
Unique
| Training Data | Original Data | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Training Data | Original Data | |
|---|---|---|
| 1st row | Female | Male |
| 2nd row | Male | Female |
| 3rd row | Male | Female |
| 4th row | Female | Female |
| 5th row | Male | Other |
Common Values
| Value | Count | Frequency (%) |
| Female | 306175 | |
| Male | 284091 | |
| Other | 3728 | 0.6% |
| Value | Count | Frequency (%) |
| Female | 10034 | |
| Male | 9536 | |
| Other | 430 | 2.1% |
Length
Histogram of lengths of the category
Common Values (Plot)
Training Data
Original Data
| Value | Count | Frequency (%) |
| female | 306175 | |
| male | 284091 | |
| other | 3728 | 0.6% |
| Value | Count | Frequency (%) |
| female | 10034 | |
| male | 9536 | |
| other | 430 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 900169 | |
| a | 590266 | |
| l | 590266 | |
| F | 306175 | 10.2% |
| m | 306175 | 10.2% |
| M | 284091 | 9.5% |
| O | 3728 | 0.1% |
| t | 3728 | 0.1% |
| h | 3728 | 0.1% |
| r | 3728 | 0.1% |
| Value | Count | Frequency (%) |
| e | 30034 | |
| a | 19570 | |
| l | 19570 | |
| F | 10034 | 10.0% |
| m | 10034 | 10.0% |
| M | 9536 | 9.5% |
| O | 430 | 0.4% |
| t | 430 | 0.4% |
| h | 430 | 0.4% |
| r | 430 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2992054 |
| Value | Count | Frequency (%) |
| (unknown) | 100498 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 900169 | |
| a | 590266 | |
| l | 590266 | |
| F | 306175 | 10.2% |
| m | 306175 | 10.2% |
| M | 284091 | 9.5% |
| O | 3728 | 0.1% |
| t | 3728 | 0.1% |
| h | 3728 | 0.1% |
| r | 3728 | 0.1% |
| Value | Count | Frequency (%) |
| e | 30034 | |
| a | 19570 | |
| l | 19570 | |
| F | 10034 | 10.0% |
| m | 10034 | 10.0% |
| M | 9536 | 9.5% |
| O | 430 | 0.4% |
| t | 430 | 0.4% |
| h | 430 | 0.4% |
| r | 430 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2992054 |
| Value | Count | Frequency (%) |
| (unknown) | 100498 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 900169 | |
| a | 590266 | |
| l | 590266 | |
| F | 306175 | 10.2% |
| m | 306175 | 10.2% |
| M | 284091 | 9.5% |
| O | 3728 | 0.1% |
| t | 3728 | 0.1% |
| h | 3728 | 0.1% |
| r | 3728 | 0.1% |
| Value | Count | Frequency (%) |
| e | 30034 | |
| a | 19570 | |
| l | 19570 | |
| F | 10034 | 10.0% |
| m | 10034 | 10.0% |
| M | 9536 | 9.5% |
| O | 430 | 0.4% |
| t | 430 | 0.4% |
| h | 430 | 0.4% |
| r | 430 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2992054 |
| Value | Count | Frequency (%) |
| (unknown) | 100498 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 900169 | |
| a | 590266 | |
| l | 590266 | |
| F | 306175 | 10.2% |
| m | 306175 | 10.2% |
| M | 284091 | 9.5% |
| O | 3728 | 0.1% |
| t | 3728 | 0.1% |
| h | 3728 | 0.1% |
| r | 3728 | 0.1% |
| Value | Count | Frequency (%) |
| e | 30034 | |
| a | 19570 | |
| l | 19570 | |
| F | 10034 | 10.0% |
| m | 10034 | 10.0% |
| M | 9536 | 9.5% |
| O | 430 | 0.4% |
| t | 430 | 0.4% |
| h | 430 | 0.4% |
| r | 430 | 0.4% |
marital_status
Categorical
| Training Data | Original Data | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 156.4 KiB |
| Single | |
|---|---|
| Married | |
| Divorced | 21312 |
| Widowed | 6600 |
| Single | |
|---|---|
| Married | |
| Divorced | |
| Widowed | 567 |
Length
| Training Data | Original Data | |
|---|---|---|
| Max length | 8 | 8 |
| Median length | 7 | 7 |
| Mean length | 6.5496066 | 6.61985 |
| Min length | 6 | 6 |
Unique
| Training Data | Original Data | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Training Data | Original Data | |
|---|---|---|
| 1st row | Single | Married |
| 2nd row | Married | Married |
| 3rd row | Single | Single |
| 4th row | Single | Single |
| 5th row | Married | Single |
Common Values
| Value | Count | Frequency (%) |
| Single | 288843 | |
| Married | 277239 | |
| Divorced | 21312 | 3.6% |
| Widowed | 6600 | 1.1% |
| Value | Count | Frequency (%) |
| Single | 9031 | |
| Married | 8974 | |
| Divorced | 1428 | 7.1% |
| Widowed | 567 | 2.8% |
Length
Histogram of lengths of the category
Common Values (Plot)
Training Data
Original Data
| Value | Count | Frequency (%) |
| single | 288843 | |
| married | 277239 | |
| divorced | 21312 | 3.6% |
| widowed | 6600 | 1.1% |
| Value | Count | Frequency (%) |
| single | 9031 | |
| married | 8974 | |
| divorced | 1428 | 7.1% |
| widowed | 567 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 593994 | |
| e | 593994 | |
| r | 575790 | |
| d | 311751 | |
| g | 288843 | |
| l | 288843 | |
| n | 288843 | |
| S | 288843 | |
| a | 277239 | |
| M | 277239 | |
| Other values (6) | 105048 | 2.7% |
| Value | Count | Frequency (%) |
| i | 20000 | |
| e | 20000 | |
| r | 19376 | |
| d | 11536 | |
| g | 9031 | |
| l | 9031 | |
| n | 9031 | |
| S | 9031 | |
| a | 8974 | |
| M | 8974 | |
| Other values (6) | 7413 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3890427 |
| Value | Count | Frequency (%) |
| (unknown) | 132397 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 593994 | |
| e | 593994 | |
| r | 575790 | |
| d | 311751 | |
| g | 288843 | |
| l | 288843 | |
| n | 288843 | |
| S | 288843 | |
| a | 277239 | |
| M | 277239 | |
| Other values (6) | 105048 | 2.7% |
| Value | Count | Frequency (%) |
| i | 20000 | |
| e | 20000 | |
| r | 19376 | |
| d | 11536 | |
| g | 9031 | |
| l | 9031 | |
| n | 9031 | |
| S | 9031 | |
| a | 8974 | |
| M | 8974 | |
| Other values (6) | 7413 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3890427 |
| Value | Count | Frequency (%) |
| (unknown) | 132397 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 593994 | |
| e | 593994 | |
| r | 575790 | |
| d | 311751 | |
| g | 288843 | |
| l | 288843 | |
| n | 288843 | |
| S | 288843 | |
| a | 277239 | |
| M | 277239 | |
| Other values (6) | 105048 | 2.7% |
| Value | Count | Frequency (%) |
| i | 20000 | |
| e | 20000 | |
| r | 19376 | |
| d | 11536 | |
| g | 9031 | |
| l | 9031 | |
| n | 9031 | |
| S | 9031 | |
| a | 8974 | |
| M | 8974 | |
| Other values (6) | 7413 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3890427 |
| Value | Count | Frequency (%) |
| (unknown) | 132397 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 593994 | |
| e | 593994 | |
| r | 575790 | |
| d | 311751 | |
| g | 288843 | |
| l | 288843 | |
| n | 288843 | |
| S | 288843 | |
| a | 277239 | |
| M | 277239 | |
| Other values (6) | 105048 | 2.7% |
| Value | Count | Frequency (%) |
| i | 20000 | |
| e | 20000 | |
| r | 19376 | |
| d | 11536 | |
| g | 9031 | |
| l | 9031 | |
| n | 9031 | |
| S | 9031 | |
| a | 8974 | |
| M | 8974 | |
| Other values (6) | 7413 | 5.6% |
education_level
Categorical
| Training Data | Original Data | |
|---|---|---|
| Distinct | 5 | 5 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 156.4 KiB |
| Bachelor's | |
|---|---|
| High School | |
| Master's | |
| Other | 26677 |
| PhD | 11022 |
| Bachelor's | |
|---|---|
| High School | |
| Master's | |
| Other | |
| PhD | 804 |
Length
| Training Data | Original Data | |
|---|---|---|
| Max length | 11 | 11 |
| Median length | 10 | 10 |
| Mean length | 9.6411731 | 9.26515 |
| Min length | 3 | 3 |
Unique
| Training Data | Original Data | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Training Data | Original Data | |
|---|---|---|
| 1st row | High School | Master's |
| 2nd row | Master's | Bachelor's |
| 3rd row | High School | High School |
| 4th row | High School | High School |
| 5th row | High School | Other |
Common Values
| Value | Count | Frequency (%) |
| Bachelor's | 279606 | |
| High School | 183592 | |
| Master's | 93097 | 15.7% |
| Other | 26677 | 4.5% |
| PhD | 11022 | 1.9% |
| Value | Count | Frequency (%) |
| Bachelor's | 8045 | |
| High School | 5919 | |
| Master's | 3724 | |
| Other | 1508 | 7.5% |
| PhD | 804 | 4.0% |
Length
Histogram of lengths of the category
Common Values (Plot)
Training Data
Original Data
| Value | Count | Frequency (%) |
| bachelor's | 279606 | |
| high | 183592 | |
| school | 183592 | |
| master's | 93097 | 12.0% |
| other | 26677 | 3.4% |
| phd | 11022 | 1.4% |
| Value | Count | Frequency (%) |
| bachelor's | 8045 | |
| high | 5919 | |
| school | 5919 | |
| master's | 3724 | |
| other | 1508 | 5.8% |
| phd | 804 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| h | 684489 | |
| o | 646790 | |
| s | 465800 | 8.1% |
| c | 463198 | 8.1% |
| l | 463198 | 8.1% |
| e | 399380 | 7.0% |
| r | 399380 | 7.0% |
| a | 372703 | 6.5% |
| ' | 372703 | 6.5% |
| B | 279606 | 4.9% |
| Other values (10) | 1179552 |
| Value | Count | Frequency (%) |
| h | 22195 | |
| o | 19883 | |
| s | 15493 | 8.4% |
| c | 13964 | 7.5% |
| l | 13964 | 7.5% |
| e | 13277 | 7.2% |
| r | 13277 | 7.2% |
| a | 11769 | 6.4% |
| ' | 11769 | 6.4% |
| B | 8045 | 4.3% |
| Other values (10) | 41667 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5726799 |
| Value | Count | Frequency (%) |
| (unknown) | 185303 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| h | 684489 | |
| o | 646790 | |
| s | 465800 | 8.1% |
| c | 463198 | 8.1% |
| l | 463198 | 8.1% |
| e | 399380 | 7.0% |
| r | 399380 | 7.0% |
| a | 372703 | 6.5% |
| ' | 372703 | 6.5% |
| B | 279606 | 4.9% |
| Other values (10) | 1179552 |
| Value | Count | Frequency (%) |
| h | 22195 | |
| o | 19883 | |
| s | 15493 | 8.4% |
| c | 13964 | 7.5% |
| l | 13964 | 7.5% |
| e | 13277 | 7.2% |
| r | 13277 | 7.2% |
| a | 11769 | 6.4% |
| ' | 11769 | 6.4% |
| B | 8045 | 4.3% |
| Other values (10) | 41667 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5726799 |
| Value | Count | Frequency (%) |
| (unknown) | 185303 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| h | 684489 | |
| o | 646790 | |
| s | 465800 | 8.1% |
| c | 463198 | 8.1% |
| l | 463198 | 8.1% |
| e | 399380 | 7.0% |
| r | 399380 | 7.0% |
| a | 372703 | 6.5% |
| ' | 372703 | 6.5% |
| B | 279606 | 4.9% |
| Other values (10) | 1179552 |
| Value | Count | Frequency (%) |
| h | 22195 | |
| o | 19883 | |
| s | 15493 | 8.4% |
| c | 13964 | 7.5% |
| l | 13964 | 7.5% |
| e | 13277 | 7.2% |
| r | 13277 | 7.2% |
| a | 11769 | 6.4% |
| ' | 11769 | 6.4% |
| B | 8045 | 4.3% |
| Other values (10) | 41667 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5726799 |
| Value | Count | Frequency (%) |
| (unknown) | 185303 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| h | 684489 | |
| o | 646790 | |
| s | 465800 | 8.1% |
| c | 463198 | 8.1% |
| l | 463198 | 8.1% |
| e | 399380 | 7.0% |
| r | 399380 | 7.0% |
| a | 372703 | 6.5% |
| ' | 372703 | 6.5% |
| B | 279606 | 4.9% |
| Other values (10) | 1179552 |
| Value | Count | Frequency (%) |
| h | 22195 | |
| o | 19883 | |
| s | 15493 | 8.4% |
| c | 13964 | 7.5% |
| l | 13964 | 7.5% |
| e | 13277 | 7.2% |
| r | 13277 | 7.2% |
| a | 11769 | 6.4% |
| ' | 11769 | 6.4% |
| B | 8045 | 4.3% |
| Other values (10) | 41667 |
employment_status
Categorical
| Training Data | Original Data | |
|---|---|---|
| Distinct | 5 | 5 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 156.4 KiB |
| Employed | |
|---|---|
| Unemployed | |
| Self-employed | |
| Retired | 16453 |
| Student | 11931 |
| Employed | |
|---|---|
| Self-employed | |
| Unemployed | |
| Retired | 1176 |
| Student | 781 |
Length
| Training Data | Original Data | |
|---|---|---|
| Max length | 13 | 13 |
| Median length | 8 | 8 |
| Mean length | 8.6043596 | 8.8442 |
| Min length | 7 | 7 |
Unique
| Training Data | Original Data | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Training Data | Original Data | |
|---|---|---|
| 1st row | Self-employed | Employed |
| 2nd row | Employed | Employed |
| 3rd row | Employed | Employed |
| 4th row | Employed | Employed |
| 5th row | Employed | Employed |
Common Values
| Value | Count | Frequency (%) |
| Employed | 450645 | |
| Unemployed | 62485 | 10.5% |
| Self-employed | 52480 | 8.8% |
| Retired | 16453 | 2.8% |
| Student | 11931 | 2.0% |
| Value | Count | Frequency (%) |
| Employed | 13007 | |
| Self-employed | 2923 | 14.6% |
| Unemployed | 2113 | 10.6% |
| Retired | 1176 | 5.9% |
| Student | 781 | 3.9% |
Length
Histogram of lengths of the category
Common Values (Plot)
Training Data
Original Data
| Value | Count | Frequency (%) |
| employed | 450645 | |
| unemployed | 62485 | 10.5% |
| self-employed | 52480 | 8.8% |
| retired | 16453 | 2.8% |
| student | 11931 | 2.0% |
| Value | Count | Frequency (%) |
| employed | 13007 | |
| self-employed | 2923 | 14.6% |
| unemployed | 2113 | 10.6% |
| retired | 1176 | 5.9% |
| student | 781 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 777892 | |
| l | 618090 | |
| d | 593994 | |
| m | 565610 | |
| y | 565610 | |
| p | 565610 | |
| o | 565610 | |
| E | 450645 | |
| n | 74416 | 1.5% |
| S | 64411 | 1.3% |
| Other values (8) | 269050 | 5.3% |
| Value | Count | Frequency (%) |
| e | 29135 | |
| l | 20966 | |
| d | 20000 | |
| m | 18043 | |
| y | 18043 | |
| p | 18043 | |
| o | 18043 | |
| E | 13007 | |
| S | 3704 | 2.1% |
| f | 2923 | 1.7% |
| Other values (8) | 14977 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5110938 |
| Value | Count | Frequency (%) |
| (unknown) | 176884 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 777892 | |
| l | 618090 | |
| d | 593994 | |
| m | 565610 | |
| y | 565610 | |
| p | 565610 | |
| o | 565610 | |
| E | 450645 | |
| n | 74416 | 1.5% |
| S | 64411 | 1.3% |
| Other values (8) | 269050 | 5.3% |
| Value | Count | Frequency (%) |
| e | 29135 | |
| l | 20966 | |
| d | 20000 | |
| m | 18043 | |
| y | 18043 | |
| p | 18043 | |
| o | 18043 | |
| E | 13007 | |
| S | 3704 | 2.1% |
| f | 2923 | 1.7% |
| Other values (8) | 14977 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5110938 |
| Value | Count | Frequency (%) |
| (unknown) | 176884 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 777892 | |
| l | 618090 | |
| d | 593994 | |
| m | 565610 | |
| y | 565610 | |
| p | 565610 | |
| o | 565610 | |
| E | 450645 | |
| n | 74416 | 1.5% |
| S | 64411 | 1.3% |
| Other values (8) | 269050 | 5.3% |
| Value | Count | Frequency (%) |
| e | 29135 | |
| l | 20966 | |
| d | 20000 | |
| m | 18043 | |
| y | 18043 | |
| p | 18043 | |
| o | 18043 | |
| E | 13007 | |
| S | 3704 | 2.1% |
| f | 2923 | 1.7% |
| Other values (8) | 14977 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5110938 |
| Value | Count | Frequency (%) |
| (unknown) | 176884 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 777892 | |
| l | 618090 | |
| d | 593994 | |
| m | 565610 | |
| y | 565610 | |
| p | 565610 | |
| o | 565610 | |
| E | 450645 | |
| n | 74416 | 1.5% |
| S | 64411 | 1.3% |
| Other values (8) | 269050 | 5.3% |
| Value | Count | Frequency (%) |
| e | 29135 | |
| l | 20966 | |
| d | 20000 | |
| m | 18043 | |
| y | 18043 | |
| p | 18043 | |
| o | 18043 | |
| E | 13007 | |
| S | 3704 | 2.1% |
| f | 2923 | 1.7% |
| Other values (8) | 14977 |
loan_purpose
Categorical
| Training Data | Original Data | |
|---|---|---|
| Distinct | 8 | 8 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 156.4 KiB |
| Debt consolidation | |
|---|---|
| Other | |
| Car | |
| Home | |
| Education | |
| Other values (3) |
| Debt consolidation | |
|---|---|
| Other | |
| Car | |
| Home | |
| Education | |
| Other values (3) |
Length
| Training Data | Original Data | |
|---|---|---|
| Max length | 18 | 18 |
| Median length | 18 | 9 |
| Mean length | 12.38077 | 10.64005 |
| Min length | 3 | 3 |
Unique
| Training Data | Original Data | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Training Data | Original Data | |
|---|---|---|
| 1st row | Other | Car |
| 2nd row | Debt consolidation | Debt consolidation |
| 3rd row | Debt consolidation | Business |
| 4th row | Debt consolidation | Other |
| 5th row | Other | Car |
Common Values
| Value | Count | Frequency (%) |
| Debt consolidation | 324695 | |
| Other | 63874 | 10.8% |
| Car | 58108 | 9.8% |
| Home | 44118 | 7.4% |
| Education | 36641 | 6.2% |
| Business | 35303 | 5.9% |
| Medical | 22806 | 3.8% |
| Vacation | 8449 | 1.4% |
| Value | Count | Frequency (%) |
| Debt consolidation | 7981 | |
| Other | 2550 | 12.8% |
| Car | 2390 | 11.9% |
| Home | 1972 | 9.9% |
| Education | 1675 | 8.4% |
| Business | 1629 | 8.1% |
| Medical | 1196 | 6.0% |
| Vacation | 607 | 3.0% |
Length
Histogram of lengths of the category
Common Values (Plot)
Training Data
Original Data
| Value | Count | Frequency (%) |
| debt | 324695 | |
| consolidation | 324695 | |
| other | 63874 | 7.0% |
| car | 58108 | 6.3% |
| home | 44118 | 4.8% |
| education | 36641 | 4.0% |
| business | 35303 | 3.8% |
| medical | 22806 | 2.5% |
| vacation | 8449 | 0.9% |
| Value | Count | Frequency (%) |
| debt | 7981 | |
| consolidation | 7981 | |
| other | 2550 | 9.1% |
| car | 2390 | 8.5% |
| home | 1972 | 7.0% |
| education | 1675 | 6.0% |
| business | 1629 | 5.8% |
| medical | 1196 | 4.3% |
| vacation | 607 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1063293 | |
| t | 758354 | |
| i | 752589 | |
| n | 729783 | |
| e | 490796 | 6.7% |
| a | 459148 | 6.2% |
| s | 430604 | 5.9% |
| c | 392591 | 5.3% |
| d | 384142 | 5.2% |
| l | 347501 | 4.7% |
| Other values (14) | 1545302 |
| Value | Count | Frequency (%) |
| o | 28197 | |
| i | 21069 | |
| t | 20794 | |
| n | 19873 | |
| e | 15328 | 7.2% |
| a | 14456 | 6.8% |
| s | 12868 | 6.0% |
| c | 11459 | 5.4% |
| d | 10852 | 5.1% |
| l | 9177 | 4.3% |
| Other values (14) | 48728 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7354103 |
| Value | Count | Frequency (%) |
| (unknown) | 212801 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 1063293 | |
| t | 758354 | |
| i | 752589 | |
| n | 729783 | |
| e | 490796 | 6.7% |
| a | 459148 | 6.2% |
| s | 430604 | 5.9% |
| c | 392591 | 5.3% |
| d | 384142 | 5.2% |
| l | 347501 | 4.7% |
| Other values (14) | 1545302 |
| Value | Count | Frequency (%) |
| o | 28197 | |
| i | 21069 | |
| t | 20794 | |
| n | 19873 | |
| e | 15328 | 7.2% |
| a | 14456 | 6.8% |
| s | 12868 | 6.0% |
| c | 11459 | 5.4% |
| d | 10852 | 5.1% |
| l | 9177 | 4.3% |
| Other values (14) | 48728 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7354103 |
| Value | Count | Frequency (%) |
| (unknown) | 212801 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 1063293 | |
| t | 758354 | |
| i | 752589 | |
| n | 729783 | |
| e | 490796 | 6.7% |
| a | 459148 | 6.2% |
| s | 430604 | 5.9% |
| c | 392591 | 5.3% |
| d | 384142 | 5.2% |
| l | 347501 | 4.7% |
| Other values (14) | 1545302 |
| Value | Count | Frequency (%) |
| o | 28197 | |
| i | 21069 | |
| t | 20794 | |
| n | 19873 | |
| e | 15328 | 7.2% |
| a | 14456 | 6.8% |
| s | 12868 | 6.0% |
| c | 11459 | 5.4% |
| d | 10852 | 5.1% |
| l | 9177 | 4.3% |
| Other values (14) | 48728 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7354103 |
| Value | Count | Frequency (%) |
| (unknown) | 212801 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 1063293 | |
| t | 758354 | |
| i | 752589 | |
| n | 729783 | |
| e | 490796 | 6.7% |
| a | 459148 | 6.2% |
| s | 430604 | 5.9% |
| c | 392591 | 5.3% |
| d | 384142 | 5.2% |
| l | 347501 | 4.7% |
| Other values (14) | 1545302 |
| Value | Count | Frequency (%) |
| o | 28197 | |
| i | 21069 | |
| t | 20794 | |
| n | 19873 | |
| e | 15328 | 7.2% |
| a | 14456 | 6.8% |
| s | 12868 | 6.0% |
| c | 11459 | 5.4% |
| d | 10852 | 5.1% |
| l | 9177 | 4.3% |
| Other values (14) | 48728 |
grade_subgrade
Categorical
| Training Data | Original Data | |
|---|---|---|
| Distinct | 30 | 30 |
| Distinct (%) | < 0.1% | 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 156.4 KiB |
| C3 | |
|---|---|
| C4 | |
| C2 | |
| C1 | |
| C5 | |
| Other values (25) |
| C3 | |
|---|---|
| C4 | |
| C2 | |
| C5 | |
| C1 | |
| Other values (25) |
Length
| Training Data | Original Data | |
|---|---|---|
| Max length | 2 | 2 |
| Median length | 2 | 2 |
| Mean length | 2 | 2 |
| Min length | 2 | 2 |
Unique
| Training Data | Original Data | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Training Data | Original Data | |
|---|---|---|
| 1st row | C3 | B5 |
| 2nd row | D3 | F1 |
| 3rd row | C5 | B4 |
| 4th row | F1 | A5 |
| 5th row | D1 | D5 |
Common Values
| Value | Count | Frequency (%) |
| C3 | 58695 | |
| C4 | 55957 | 9.4% |
| C2 | 54443 | 9.2% |
| C1 | 53363 | 9.0% |
| C5 | 53317 | 9.0% |
| D1 | 37029 | 6.2% |
| D3 | 36694 | 6.2% |
| D4 | 35097 | 5.9% |
| D2 | 34432 | 5.8% |
| D5 | 32101 | 5.4% |
| Other values (20) | 142866 |
| Value | Count | Frequency (%) |
| C3 | 1514 | 7.6% |
| C4 | 1463 | 7.3% |
| C2 | 1436 | 7.2% |
| C5 | 1422 | 7.1% |
| C1 | 1410 | 7.0% |
| D1 | 1155 | 5.8% |
| D3 | 1146 | 5.7% |
| D4 | 1100 | 5.5% |
| D2 | 1091 | 5.5% |
| D5 | 1073 | 5.4% |
| Other values (20) | 7190 |
Length
Histogram of lengths of the category
Common Values (Plot)
Training Data
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)Original Data
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)| Value | Count | Frequency (%) |
| c3 | 58695 | |
| c4 | 55957 | 9.4% |
| c2 | 54443 | 9.2% |
| c1 | 53363 | 9.0% |
| c5 | 53317 | 9.0% |
| d1 | 37029 | 6.2% |
| d3 | 36694 | 6.2% |
| d4 | 35097 | 5.9% |
| d2 | 34432 | 5.8% |
| d5 | 32101 | 5.4% |
| Other values (20) | 142866 |
| Value | Count | Frequency (%) |
| c3 | 1514 | 7.6% |
| c4 | 1463 | 7.3% |
| c2 | 1436 | 7.2% |
| c5 | 1422 | 7.1% |
| c1 | 1410 | 7.0% |
| d1 | 1155 | 5.8% |
| d3 | 1146 | 5.7% |
| d4 | 1100 | 5.5% |
| d2 | 1091 | 5.5% |
| d5 | 1073 | 5.4% |
| Other values (20) | 7190 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 275775 | |
| D | 175353 | |
| 3 | 123538 | |
| 4 | 120203 | |
| 1 | 118761 | |
| 2 | 117635 | |
| 5 | 113857 | |
| B | 71251 | 6.0% |
| E | 34458 | 2.9% |
| F | 27301 | 2.3% |
| Value | Count | Frequency (%) |
| C | 7245 | |
| D | 5565 | |
| 3 | 4078 | |
| 4 | 4016 | |
| 2 | 3983 | |
| 1 | 3981 | |
| 5 | 3942 | |
| B | 3075 | |
| E | 1695 | 4.2% |
| F | 1551 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1187988 |
| Value | Count | Frequency (%) |
| (unknown) | 40000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 275775 | |
| D | 175353 | |
| 3 | 123538 | |
| 4 | 120203 | |
| 1 | 118761 | |
| 2 | 117635 | |
| 5 | 113857 | |
| B | 71251 | 6.0% |
| E | 34458 | 2.9% |
| F | 27301 | 2.3% |
| Value | Count | Frequency (%) |
| C | 7245 | |
| D | 5565 | |
| 3 | 4078 | |
| 4 | 4016 | |
| 2 | 3983 | |
| 1 | 3981 | |
| 5 | 3942 | |
| B | 3075 | |
| E | 1695 | 4.2% |
| F | 1551 | 3.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1187988 |
| Value | Count | Frequency (%) |
| (unknown) | 40000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 275775 | |
| D | 175353 | |
| 3 | 123538 | |
| 4 | 120203 | |
| 1 | 118761 | |
| 2 | 117635 | |
| 5 | 113857 | |
| B | 71251 | 6.0% |
| E | 34458 | 2.9% |
| F | 27301 | 2.3% |
| Value | Count | Frequency (%) |
| C | 7245 | |
| D | 5565 | |
| 3 | 4078 | |
| 4 | 4016 | |
| 2 | 3983 | |
| 1 | 3981 | |
| 5 | 3942 | |
| B | 3075 | |
| E | 1695 | 4.2% |
| F | 1551 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1187988 |
| Value | Count | Frequency (%) |
| (unknown) | 40000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 275775 | |
| D | 175353 | |
| 3 | 123538 | |
| 4 | 120203 | |
| 1 | 118761 | |
| 2 | 117635 | |
| 5 | 113857 | |
| B | 71251 | 6.0% |
| E | 34458 | 2.9% |
| F | 27301 | 2.3% |
| Value | Count | Frequency (%) |
| C | 7245 | |
| D | 5565 | |
| 3 | 4078 | |
| 4 | 4016 | |
| 2 | 3983 | |
| 1 | 3981 | |
| 5 | 3942 | |
| B | 3075 | |
| E | 1695 | 4.2% |
| F | 1551 | 3.9% |
loan_paid_back
Categorical
| Training Data | Original Data | |
|---|---|---|
| Distinct | 2 | 2 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 156.4 KiB |
| 1.0 | |
|---|---|
| 0.0 |
| 1 | |
|---|---|
| 0 |
Length
| Training Data | Original Data | |
|---|---|---|
| Max length | 3 | 1 |
| Median length | 3 | 1 |
| Mean length | 3 | 1 |
| Min length | 3 | 1 |
Unique
| Training Data | Original Data | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Training Data | Original Data | |
|---|---|---|
| 1st row | 1.0 | 1 |
| 2nd row | 0.0 | 1 |
| 3rd row | 1.0 | 1 |
| 4th row | 1.0 | 1 |
| 5th row | 1.0 | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 474494 | |
| 0.0 | 119500 | 20.1% |
| Value | Count | Frequency (%) |
| 1 | 15998 | |
| 0 | 4002 | 20.0% |
Length
Histogram of lengths of the category
Common Values (Plot)
Training Data
Original Data
| Value | Count | Frequency (%) |
| 1.0 | 474494 | |
| 0.0 | 119500 | 20.1% |
| Value | Count | Frequency (%) |
| 1 | 15998 | |
| 0 | 4002 | 20.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 713494 | |
| . | 593994 | |
| 1 | 474494 |
| Value | Count | Frequency (%) |
| 1 | 15998 | |
| 0 | 4002 | 20.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1781982 |
| Value | Count | Frequency (%) |
| (unknown) | 20000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 713494 | |
| . | 593994 | |
| 1 | 474494 |
| Value | Count | Frequency (%) |
| 1 | 15998 | |
| 0 | 4002 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1781982 |
| Value | Count | Frequency (%) |
| (unknown) | 20000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 713494 | |
| . | 593994 | |
| 1 | 474494 |
| Value | Count | Frequency (%) |
| 1 | 15998 | |
| 0 | 4002 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1781982 |
| Value | Count | Frequency (%) |
| (unknown) | 20000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 713494 | |
| . | 593994 | |
| 1 | 474494 |
| Value | Count | Frequency (%) |
| 1 | 15998 | |
| 0 | 4002 | 20.0% |
Interactions
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Training Data
Original Data
Correlations
Training Data
Original Data
Training Data
| annual_income | credit_score | debt_to_income_ratio | education_level | employment_status | gender | grade_subgrade | interest_rate | loan_amount | loan_paid_back | loan_purpose | marital_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| annual_income | 1.000 | 0.004 | 0.005 | 0.008 | 0.009 | 0.004 | 0.007 | -0.003 | -0.009 | 0.020 | 0.007 | 0.010 |
| credit_score | 0.004 | 1.000 | -0.060 | 0.007 | 0.051 | 0.008 | 0.638 | -0.517 | -0.008 | 0.232 | 0.008 | 0.011 |
| debt_to_income_ratio | 0.005 | -0.060 | 1.000 | 0.006 | 0.088 | 0.004 | 0.024 | 0.026 | -0.012 | 0.334 | 0.006 | 0.004 |
| education_level | 0.008 | 0.007 | 0.006 | 1.000 | 0.012 | 0.004 | 0.013 | 0.008 | 0.005 | 0.025 | 0.011 | 0.008 |
| employment_status | 0.009 | 0.051 | 0.088 | 0.012 | 1.000 | 0.003 | 0.052 | 0.025 | 0.010 | 0.657 | 0.015 | 0.006 |
| gender | 0.004 | 0.008 | 0.004 | 0.004 | 0.003 | 1.000 | 0.009 | 0.004 | 0.010 | 0.007 | 0.007 | 0.002 |
| grade_subgrade | 0.007 | 0.638 | 0.024 | 0.013 | 0.052 | 0.009 | 1.000 | 0.192 | 0.013 | 0.228 | 0.008 | 0.013 |
| interest_rate | -0.003 | -0.517 | 0.026 | 0.008 | 0.025 | 0.004 | 0.192 | 1.000 | -0.001 | 0.129 | 0.006 | 0.006 |
| loan_amount | -0.009 | -0.008 | -0.012 | 0.005 | 0.010 | 0.010 | 0.013 | -0.001 | 1.000 | 0.013 | 0.008 | 0.008 |
| loan_paid_back | 0.020 | 0.232 | 0.334 | 0.025 | 0.657 | 0.007 | 0.228 | 0.129 | 0.013 | 1.000 | 0.025 | 0.001 |
| loan_purpose | 0.007 | 0.008 | 0.006 | 0.011 | 0.015 | 0.007 | 0.008 | 0.006 | 0.008 | 0.025 | 1.000 | 0.010 |
| marital_status | 0.010 | 0.011 | 0.004 | 0.008 | 0.006 | 0.002 | 0.013 | 0.006 | 0.008 | 0.001 | 0.010 | 1.000 |
Original Data
| annual_income | credit_score | debt_to_income_ratio | education_level | employment_status | gender | grade_subgrade | interest_rate | loan_amount | loan_paid_back | loan_purpose | marital_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| annual_income | 1.000 | 0.005 | -0.003 | 0.000 | 0.004 | 0.000 | 0.004 | -0.006 | 0.002 | 0.029 | 0.000 | 0.006 |
| credit_score | 0.005 | 1.000 | -0.025 | 0.000 | 0.000 | 0.000 | 0.634 | -0.551 | 0.008 | 0.198 | 0.000 | 0.004 |
| debt_to_income_ratio | -0.003 | -0.025 | 1.000 | 0.004 | 0.012 | 0.023 | 0.011 | 0.007 | -0.008 | 0.220 | 0.005 | 0.000 |
| education_level | 0.000 | 0.000 | 0.004 | 1.000 | 0.005 | 0.000 | 0.000 | 0.000 | 0.000 | 0.018 | 0.010 | 0.004 |
| employment_status | 0.004 | 0.000 | 0.012 | 0.005 | 1.000 | 0.000 | 0.006 | 0.000 | 0.008 | 0.584 | 0.000 | 0.000 |
| gender | 0.000 | 0.000 | 0.023 | 0.000 | 0.000 | 1.000 | 0.000 | 0.010 | 0.016 | 0.000 | 0.000 | 0.000 |
| grade_subgrade | 0.004 | 0.634 | 0.011 | 0.000 | 0.006 | 0.000 | 1.000 | 0.206 | 0.000 | 0.192 | 0.004 | 0.000 |
| interest_rate | -0.006 | -0.551 | 0.007 | 0.000 | 0.000 | 0.010 | 0.206 | 1.000 | -0.009 | 0.109 | 0.008 | 0.000 |
| loan_amount | 0.002 | 0.008 | -0.008 | 0.000 | 0.008 | 0.016 | 0.000 | -0.009 | 1.000 | 0.000 | 0.000 | 0.008 |
| loan_paid_back | 0.029 | 0.198 | 0.220 | 0.018 | 0.584 | 0.000 | 0.192 | 0.109 | 0.000 | 1.000 | 0.021 | 0.000 |
| loan_purpose | 0.000 | 0.000 | 0.005 | 0.010 | 0.000 | 0.000 | 0.004 | 0.008 | 0.000 | 0.021 | 1.000 | 0.011 |
| marital_status | 0.006 | 0.004 | 0.000 | 0.004 | 0.000 | 0.000 | 0.000 | 0.000 | 0.008 | 0.000 | 0.011 | 1.000 |
Missing values
Training Data
A simple visualization of nullity by column.
Original Data
A simple visualization of nullity by column.
Training Data
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Original Data
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Sample
Training Data
| annual_income | debt_to_income_ratio | credit_score | loan_amount | interest_rate | gender | marital_status | education_level | employment_status | loan_purpose | grade_subgrade | loan_paid_back | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 29,367.990 | 0.084 | 736 | 2,528.420 | 13.670 | Female | Single | High School | Self-employed | Other | C3 | 1.000 |
| 1 | 22,108.020 | 0.166 | 636 | 4,593.100 | 12.920 | Male | Married | Master's | Employed | Debt consolidation | D3 | 0.000 |
| 2 | 49,566.200 | 0.097 | 694 | 17,005.150 | 9.760 | Male | Single | High School | Employed | Debt consolidation | C5 | 1.000 |
| 3 | 46,858.250 | 0.065 | 533 | 4,682.480 | 16.100 | Female | Single | High School | Employed | Debt consolidation | F1 | 1.000 |
| 4 | 25,496.700 | 0.053 | 665 | 12,184.430 | 10.210 | Male | Married | High School | Employed | Other | D1 | 1.000 |
| 5 | 44,940.300 | 0.058 | 653 | 12,159.920 | 12.240 | Male | Single | Bachelor's | Employed | Other | D1 | 1.000 |
| 6 | 61,574.160 | 0.042 | 696 | 16,907.710 | 13.520 | Other | Single | High School | Self-employed | Debt consolidation | C5 | 1.000 |
| 7 | 45,953.310 | 0.100 | 654 | 10,111.620 | 12.820 | Female | Married | High School | Employed | Home | D1 | 1.000 |
| 8 | 30,592.290 | 0.132 | 713 | 7,522.360 | 9.480 | Male | Married | Bachelor's | Employed | Education | C5 | 1.000 |
| 9 | 17,342.450 | 0.121 | 548 | 9,653.480 | 16.040 | Female | Married | Bachelor's | Self-employed | Vacation | F1 | 1.000 |
Original Data
| annual_income | debt_to_income_ratio | credit_score | loan_amount | interest_rate | gender | marital_status | education_level | employment_status | loan_purpose | grade_subgrade | loan_paid_back | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 24,240.190 | 0.074 | 743 | 17,173.720 | 13.390 | Male | Married | Master's | Employed | Car | B5 | 1 |
| 1 | 20,172.980 | 0.219 | 531 | 22,663.890 | 17.810 | Female | Married | Bachelor's | Employed | Debt consolidation | F1 | 1 |
| 2 | 26,181.800 | 0.234 | 779 | 3,631.360 | 9.530 | Female | Single | High School | Employed | Business | B4 | 1 |
| 3 | 11,873.840 | 0.264 | 809 | 14,939.230 | 7.990 | Female | Single | High School | Employed | Other | A5 | 1 |
| 4 | 25,326.440 | 0.260 | 663 | 16,551.710 | 15.200 | Other | Single | Other | Employed | Car | D5 | 1 |
| 5 | 55,559.800 | 0.081 | 774 | 12,724.020 | 12.730 | Male | Single | High School | Employed | Debt consolidation | B1 | 1 |
| 6 | 24,642.880 | 0.165 | 742 | 5,905.270 | 12.480 | Male | Single | Bachelor's | Unemployed | Car | B3 | 0 |
| 7 | 52,610.690 | 0.135 | 810 | 15,136.350 | 8.450 | Female | Single | Bachelor's | Employed | Debt consolidation | A3 | 1 |
| 8 | 62,922.050 | 0.074 | 724 | 500.000 | 9.950 | Other | Single | High School | Employed | Home | C4 | 1 |
| 9 | 53,439.890 | 0.375 | 796 | 14,712.380 | 11.810 | Female | Single | High School | Employed | Car | B1 | 0 |
Training Data
| annual_income | debt_to_income_ratio | credit_score | loan_amount | interest_rate | gender | marital_status | education_level | employment_status | loan_purpose | grade_subgrade | loan_paid_back | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 593984 | 36,169.340 | 0.091 | 676 | 9,986.830 | 14.180 | Female | Married | Bachelor's | Retired | Debt consolidation | C3 | 1.000 |
| 593985 | 37,188.430 | 0.170 | 718 | 17,056.520 | 10.470 | Female | Married | Bachelor's | Employed | Home | C3 | 1.000 |
| 593986 | 25,015.350 | 0.074 | 633 | 15,922.610 | 13.910 | Male | Married | Bachelor's | Employed | Debt consolidation | D2 | 0.000 |
| 593987 | 17,662.680 | 0.074 | 679 | 19,792.920 | 15.480 | Female | Single | Other | Employed | Debt consolidation | C3 | 1.000 |
| 593988 | 15,602.220 | 0.056 | 622 | 25,706.470 | 15.750 | Female | Married | High School | Employed | Debt consolidation | D2 | 1.000 |
| 593989 | 23,004.260 | 0.152 | 703 | 20,958.370 | 10.920 | Female | Single | High School | Employed | Business | C3 | 1.000 |
| 593990 | 35,289.430 | 0.105 | 559 | 3,257.240 | 14.620 | Male | Single | Bachelor's | Employed | Debt consolidation | F5 | 1.000 |
| 593991 | 47,112.640 | 0.072 | 675 | 929.270 | 14.130 | Female | Married | Bachelor's | Employed | Debt consolidation | C1 | 1.000 |
| 593992 | 76,748.440 | 0.067 | 740 | 16,290.400 | 9.870 | Male | Single | Bachelor's | Employed | Debt consolidation | B2 | 1.000 |
| 593993 | 48,959.520 | 0.096 | 752 | 7,707.730 | 10.310 | Male | Married | High School | Employed | Education | B3 | 1.000 |
Original Data
| annual_income | debt_to_income_ratio | credit_score | loan_amount | interest_rate | gender | marital_status | education_level | employment_status | loan_purpose | grade_subgrade | loan_paid_back | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 19990 | 33,789.750 | 0.049 | 641 | 14,301.950 | 11.310 | Female | Divorced | High School | Employed | Home | D5 | 1 |
| 19991 | 20,111.610 | 0.067 | 740 | 8,861.860 | 8.300 | Male | Married | High School | Employed | Home | B4 | 1 |
| 19992 | 37,651.770 | 0.278 | 664 | 11,494.360 | 12.000 | Male | Single | High School | Employed | Other | D3 | 1 |
| 19993 | 24,082.300 | 0.178 | 674 | 6,877.770 | 13.820 | Male | Married | PhD | Retired | Debt consolidation | C3 | 1 |
| 19994 | 44,960.330 | 0.176 | 792 | 13,300.710 | 11.990 | Female | Married | Bachelor's | Self-employed | Medical | B4 | 1 |
| 19995 | 39,640.080 | 0.275 | 691 | 16,322.230 | 15.050 | Female | Married | Bachelor's | Employed | Debt consolidation | C5 | 0 |
| 19996 | 32,062.900 | 0.367 | 758 | 16,697.340 | 11.890 | Female | Married | Bachelor's | Employed | Debt consolidation | B5 | 1 |
| 19997 | 18,642.020 | 0.106 | 751 | 23,924.780 | 10.060 | Female | Single | Master's | Student | Debt consolidation | B4 | 1 |
| 19998 | 22,181.390 | 0.275 | 646 | 16,920.130 | 16.060 | Male | Married | Master's | Retired | Other | D2 | 1 |
| 19999 | 23,737.700 | 0.228 | 630 | 15,769.750 | 13.070 | Female | Married | Other | Employed | Business | D2 | 0 |
Duplicate rows
Training Data
| annual_income | debt_to_income_ratio | credit_score | loan_amount | interest_rate | gender | marital_status | education_level | employment_status | loan_purpose | grade_subgrade | loan_paid_back | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dataset does not contain duplicate rows. | |||||||||||||
Original Data
| annual_income | debt_to_income_ratio | credit_score | loan_amount | interest_rate | gender | marital_status | education_level | employment_status | loan_purpose | grade_subgrade | loan_paid_back | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dataset does not contain duplicate rows. | |||||||||||||